Characteristic Gene Selection via Weighting Principal Components by Singular Values

نویسندگان

  • Jin-Xing Liu
  • Yong Xu
  • Chun-Hou Zheng
  • Yi Wang
  • Jing-Yu Yang
چکیده

Conventional gene selection methods based on principal component analysis (PCA) use only the first principal component (PC) of PCA or sparse PCA to select characteristic genes. These methods indeed assume that the first PC plays a dominant role in gene selection. However, in a number of cases this assumption is not satisfied, so the conventional PCA-based methods usually provide poor selection results. In order to improve the performance of the PCA-based gene selection method, we put forward the gene selection method via weighting PCs by singular values (WPCS). Because different PCs have different importance, the singular values are exploited as the weights to represent the influence on gene selection of different PCs. The ROC curves and AUC statistics on artificial data show that our method outperforms the state-of-the-art methods. Moreover, experimental results on real gene expression data sets show that our method can extract more characteristic genes in response to abiotic stresses than conventional gene selection methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A ‎n‎ew weighting approach to Non-Parametric composite indices compared with principal components analysis‎

Introduction of Human Development Index (HDI) by UNDP in early 1990 followed a surge in use of non-parametric and parametric indices for measurement and comparison of countries performance in development, globalization, competition, well-being and etc. The HDI is a composite index of three indicators. Its components are to reflect three major dimensions of human development: longevity, knowledg...

متن کامل

A Bayesian Shrinkage Approach for AMMI Models

Linear-bilinear models, especially the additive main effects and multiplicative interaction (AMMI) model, are widely applicable to genotype-by-environment interaction (GEI) studies in plant breeding programs. These models allow a parsimonious modeling of GE interactions, retaining a small number of principal components in the analysis. However, one aspect of the AMMI model that is still debated...

متن کامل

Free Vibration of Annular Plates by Discrete Singular Convolution and Differential Quadrature Methods

Plates and shells are significant structural components in many engineering and industrial applications. In this study, the free vibration analysis of annular plates is investigated. To this aim, two different numerical methods including the differential quadrature and the discrete singular convolution methods are performedfor numerical simulations. Moreover, the Frequency values are obtained v...

متن کامل

Mining large-scale Genomic and Proteomic Data: Algorithms, Tools and Inference

Motivation: Many methods have been developed for selecting small informative feature subsets in large noisy data. However, unsupervised methods are scarce. Examples are using the variance of data collected for each feature, or the projection of the feature on the first principal component.Weproposeanovel unsupervisedcriterion, basedonSVDentropy, selecting a feature according to its contribution...

متن کامل

Block similarity in fuzzy tuples

A common problem in decision-making is to analyze a tuple of numerical values associated with options, such as the degree of satisfaction assigned by experts to alternatives or probability values for hypotheses computed from data. With no loss of generality, it is assumed that the tuple contains values in the unit interval. For post-processing of typical value(s), singular values that may arise...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2012